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Abstract 

Background: Protein-RNA interactions are integral components of nearly every aspect of biology, including 
regulation of gene expression, assembly of cellular architectures, and pathogenesis of human diseases. However, 
studies in the past few decades have only uncovered a small fraction of the vast landscape of the protein-RNA 
interactome in any organism, and even less is known about the dynamics of protein-RNA interactions under 
changing developmental and environmental conditions. 

Results: Here, we describe the gPAR-CLIP (global photoactivatable-ribonucleoside-enhanced crosslinking and 
immunopurification) approach for capturing regions of the untranslated, polyadenylated transcriptome bound by 
RNA-binding proteins (RBPs) in budding yeast. We report over 13,000 RBP crosslinking sites in untranslated regions 
(UTRs) covering 72% of protein-coding transcripts encoded in the genome, confirming 3' UTRs as major sites for 
RBP interaction. Comparative genomic analyses reveal that RBP crosslinking sites are highly conserved, and RNA 
folding predictions indicate that secondary structural elements are constrained by protein binding and may serve 
as generalizable modes of RNA recognition. Finally, 38% of 3' UTR crosslinking sites show changes in RBP 
occupancy upon glucose or nitrogen deprivation, with major impacts on metabolic pathways as well as 
mitochondrial and ribosomal gene expression. 

Conclusions: Our study offers an unprecedented view of the pervasiveness and dynamics of protein-RNA 
interactions in vivo. 



Background 

A diverse and expanding repertoire of RNA-binding 
proteins (RBPs) ensures faithful expression and function 
of substrate mRNAs [1-3]. Many RNAs are organized by 
RBPs and other protein co-factors into higher-order 
ribonucleoprotein (RNP) assembUes that fulfill critical 
functions in storage, transport, inheritance, and degrada- 
tion of RNA [4,5]. For example, over 70% of mRNAs in 
Drosophila embryos are localized to distinct organelles, 
compartments, and membrane interfaces, providing a 
means for directing local translation and regulating cel- 
lular architectures and functions [6]. RNA and RBPs can 
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also reversibly aggregate into granules to allow RNA 
storage and decay in response to stimuli [7,8]. These 
and many other processes are driven by large, complex 
networks of protein-RNA interactions that provide spe- 
cificity in gene regulation and fidelity in RNP assembly. 
Despite important insights regarding the necessity of 
RNA regulation for cellular functions, the RBP-RNA 
interactome and its response to changing cellular condi- 
tions have yet to be fully elucidated. 

Studies of RBP-RNA interactions have historically 
relied on the identification of target transcripts bound by 
individual RBPs. In vitro selection of RNA sequences that 
bind RBPs with high affinity (systematic evolution of 
ligands by exponential enrichment, SELEX) can identify 
primary sequence recognition elements. For example. 
Nova proteins, which regulate mRNA splicing in neu- 
rons, recognize the RNA consensus sequence YCAY [9], 
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and Y box-binding protein-1, a member of the cold 
shock/Y box domain protein family, recognizes a CAYC 
RNA motif [10]. Yet these and other primary sequence 
elements identified in vitro are generally short and 
degenerate and appear too frequently in the transcrip- 
tome to be useful for in silico target identification. Micro- 
array profiling of transcripts that co-purify with 
interacting proteins (RIP-Chip) has been widely used to 
detect transcripts stably associated with RBPs, such as 
mRNAs bound by translational components HuB, elF- 
4E, and PABP in P19 embryonal carcinoma stem cells 
[11]. Similarly, RIP-Chip experiments have identified 
mRNAs associated with 40 yeast RBPs and uncovered a 
set of potential RBP recognition motifs [12,13], some of 
which were validated in vitro using SELEX [14]. Although 
capable of identifying mRNA targets for select RBPs, RIP- 
Chip is prone to artifacts, including RBP-RNA dissocia- 
tion and re-association after cell lysis [15], isolation of 
non-specific RNAs, and indirect binding through other 
co-purified RBPs [16]. In addition, RIP-Chip cannot 
detect transient interactions or resolve the exact RBP 
binding sites on identified transcripts. 

To identify transcriptome-wide footprints of RBPs in 
vivo, UV crosslinking has been coupled with immuno- 
purification of RBPs (CLIP) [17,18]. CLIP takes advan- 
tage of the photoreactivity of RNA bases, most often 
pyrimidines, with interacting amino acid side chains 
upon 254 nm UV irradiation [19]. The formation of 
covalent linkages allows stringent purification of RBP- 
RNA complexes and subsequent identification of cross- 
Unked RNA fragments via cDNA sequencing. Recently, 
a modified CLIP technique, PAR-CLIP (photoactivata- 
ble-ribonucleoside-enhanced CLIP), has been introduced 
in which photoactivatable-ribonucleoside analogs are 
incorporated into the transcriptome in live cells to 
enable efficient crosslinking using 365 nm UV irradia- 
tion [20]. Recent studies employing CLIP in mouse 
brain [21] and Caenorhabditis elegans [22] and PAR- 
CLIP in human embryonic kidney cells [20] have suc- 
cessfully decoded in vivo microRNA-mRNA interactions 
by identifying RNAs bound to Argonaute, a main com- 
ponent of the microRNA-induced silencing complex. 
PAR-CLIP has also been implemented to elucidate the 
regulatory mechanisms of human antigen R (HuR) pro- 
tein, which stabilizes gene expression by binding to AU- 
rich elements [23,24], and to identif)^ the transcriptome- 
wide distribution of non-poly(A) termination factors in 
yeast [25]. In addition to enabling efficient crosslinking, 
PAR-CLIP generates frequent and non-random nucleo- 
tide substitutions at crosslinking sites to reveal specific 
RBP-RNA contact sites with nucleotide resolution. 

Until recently, CLIP and PAR-CLIP have been limited 
to investigation of individual RBPs. Two recent studies 
introduced the use of photoactivatable-ribonucleoside- 



enhanced UV crosslinking with oligo(dT) pull-down of 
mRNAs followed by tandem mass spectrometry to glob- 
ally identify mRNA-binding proteins in human cell lines 
[26,27]. In addition to identifying known RBPs, these 
studies identified 315 [26] and 245 [27] novel RBPs that 
lack canonical RNA-binding domains and functional 
annotation as RNA-binding proteins. Castello et al. [26] 
found that RBP amino acid sequences are more disor- 
dered than those of non-RBPs and identified potential 
new classes of RNA-binding domains. Baltz et aL [27] 
additionally captured and sequenced protein-bound 
mRNAs, providing a transcriptome-wide map of poten- 
tial c/5-regulatory elements. 

Despite recent advances towards understanding global 
RBP-RNA interactions, the dynamic nature of these 
associations in vivo and the general principles driving 
these associations remain unexplored. Here, we adapt 
the PAR-CLIP technique to map all RBP binding sites 
across the yeast non-translating mRNAs in different 
environmental conditions, a method we call global PAR- 
CLIP (gPAR-CLIP). The comprehensive identification of 
RBP-RNA crosslinked sites visualized by gPAR-CLIP 
allows us to derive general properties of RBP-RNA 
interactions in vivo. Additionally, we compared RBP- 
RNA crosslinked sites in rapidly proliferating versus 
stress-treated cells and observed large-scale changes in 
RBP-RNA interactions, providing a starting point for 
dissecting the network of post-transcriptional gene regu- 
latory mechanisms underlying stress response. 

Results 

gPAR-CLIP Identifies transcriptome-wide RBP crosslinking 
sites 

To construct a global map of RBP binding sites on the 
transcriptome in vivo, we combined PAR-CLIP with 
high-throughput sequencing (Figure la; Materials and 
methods). Briefly, we metabolically incorporated the 
photoactivatable-nucleobase analog 4-thiouracil (4sU) in 
growing yeast and used UV irradiation to crosslink 4sU 
to juxtaposed proteins, 'freezing' protein-RNA interac- 
tions in vivo. Next, we implemented three biochemical 
strategies to capture RNA regions bound by the pro- 
teome: (i) sucrose gradient centrifugation to reduce ribo- 
some abundance; (ii) oligo(dT) selection to deplete 
abundant structural non-coding RNAs (for example, 
rRNAs); and (iii) chemical biotinylation of proteins. We 
exploited the high-affinity streptavidin-biotin interaction 
to purify all biotin-protein-RNA complexes with high 
efficiency and stringency. After trimming unbound RNA, 
RBP-protected fragments were ligated to linkers, con- 
verted to cDNAs, and subjected to lUumina high- 
throughput sequencing (Materials and methods). We 
term this global PAR-CLIP method 'gPAR-CLIP'. There 
are two caveats associated with our gPAR-CLIP protocol. 
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Figure 1 gPAR-CLIP identifies transcriptome-wide RBP crosslinl^ing sites, (a) Schematic of the gPAR-CLIP protocol, (b) Reproducibility of 
crosslinking sites generated from replicate gPAR-CLIP libraries prepared from yeast grown in synthetic defined media (abbreviated as WT gPAR- 
CLIP hereafter). Pearson correlation coefficient is indicated. Inset: distribution of log2 crosslinking site RPM ratios between replicates. Replicate 
error c = 1.3-fold, (c) Length distribution of crosslinking sites in WT gPAR-CLIP libraries. Dotted line: average crosslinking site length of 23 
nucleotides, (d) Pearson correlation coefficients of total mRNA-seq and gPAR-CLIP read coverage between 5' UTR, CDS, and 3' UTR regions as 
well as correlation coefficients of ribosome depleted (-ribosome) mRNA-seq and gPAR-CLIP read coverage between replicate WT libraries. 



First, during crosslinking (approximately 5 minutes), the 
cells were in nutrient-free buffer and incubated on ice, 
which could trigger changes in RBP binding. Second, we 
limited our analysis to mRNAs from the top of the 
sucrose gradient, which mostly consist of non-translating 



mRNAs, so our conclusions apply to non-translating 
mRNAs. 

To define the dynamic landscape of RBP-RNA interac- 
tions, we constructed duplicate gPAR-CLIP and mRNA- 
seq libraries, including incorporation of 4sU, for the 
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wild-type Saccharomyces cerevisiae strain cultured in 
complete media or subjected to glucose or nitrogen star- 
vation for 2 hours. An average of 10 million reads were 
sequenced from each gPAR-CLIP library. Of the 72% of 
reads that mapped uniquely to the genome, over 70% 
contained one or two T-to-C conversion events, the sig- 
nature substitution induced by 4sU crosslinking (Addi- 
tional file 1). From overlapping reads, we derived 
clusters representing RNA regions crosslinked to pro- 
teins (Additional files 2 and 3; Materials and methods). 
Crosslinking site read coverage was normalized to 
mRNA expression levels calculated as reads per million 
mapped reads per kilobase of transcript (RPKM). 
Because our approach captures protein-RNA interac- 
tions for potentially all RBPs, we cannot rule out the 
possibility that some clusters are located proximally to 
true RBP-binding sites [28]; therefore, we refer to our 
gPAR-CLIP read clusters as 'crosslinking sites'. We 
empirically assigned a false discovery rate (FDR) to each 
crosslinking site by deriving clusters from mRNA-seq 
reads with one or two T-to-C mismatches representing 
sequencing error and comparing the T-to-C conversion 
rate of these clusters to those derived from gPAR-CLIP 
reads (Materials and methods). Using a 1% FDR thresh- 
old, we reproducibly identified 80,883 crosslinking sites 
that are, on average, 23 nucleotides long (Figure lb,c): 
65,992 in protein-coding sequences (CDSs), 4,508 in 5' 
UTRs, 8,525 in 3' UTRs, and 818 in introns (Additional 
files 3 and 4). Of 6,717 annotated protein-coding tran- 
scripts, 6,228 (93%) have at least one crosslinking site. 
Because CDS crosslinking sites exhibited three-nucleo- 
tide periodicity, a hallmark of ribosome binding (Addi- 
tional file 5), we separately analyzed CDS, 5' UTR, and 
3' UTR crosslinking sites. We observed high correlation 
between gPAR-CLIP read coverage of 5' UTRs and 
CDSs (Pearson correlation coefficient, R^ = 0.45), sup- 
porting a prominent role for 5' UTRs in translational 
regulation (Figure Id). However, gPAR-CLIP read cover- 
age of 3' UTRs correlated poorly with both 5' UTRs (R^ 
= 0.20) and CDS (R^ = 0.21), suggesting a greater role 
in post-transcriptional regulation. As expected, correla- 
tion between total mRNA-seq read coverage across all 
transcript regions was approximately equal (Figure Id; 
Additional file 6). We also observed very high correla- 
tion of gPAR-CLIP and mRNA-seq read coverage 
between replicates over each genie region (Figure Id), 
reflecting low technical variation between replicates. 

gPAR-CLIP captures known and novel crosslinking sites 

To assess the performance of gPAR-CLIP in capturing 
known RBP-RNA interactions, we evaluated its ability to 
identify binding sites of Puf3p, a Pumilio-family RBP, 
which we derived from conventional PAR-CLIP using a 
strain expressing a TAP-tagged Puf3p fusion protein. Of 



the 1,236 Puf3p binding sites confidently identified by 
PAR-CLIP, 1,008 (82%) were also captured by gPAR- 
CLIP (Figure 2a; Additional file 7); for example, the two 
functionally validated Puf3p binding sites in the COXl 7 
3' UTR were identified by both Puf3p PAR-CLIP and 
gPAR-CLIP (Figure 2b) [29,30]. It is possible that other 
Puf proteins with similar RNA recognition motifs are 
binding at these sites in our gPAR-CLIP libraries [31], 
reflecting that our protocol does not distinguish the 
RBPs associated with each crosslinking site. From our 
Puf3p PAR-CLIP library, we also identified 560 novel 
Puf3p mRNA targets harboring a binding site containing 
a Puf3p recognition motif (Additional files 8 and 9). 
Given the high recovery of Puf3p sites by gPAR-CLIP, 
we conclude that gPAR-CLIP faithfully captures binding 
sites of a known RBP. 

We next examined our data for general RBP-RNA 
interaction signatures related to mRNA maturation and 
translational regulation. gPAR-CLIP read coverage of 5' 
UTRs peaked within 75 nucleotides downstream of 
annotated transcription start sites but was reduced 
when yeast were grown in media lacking glucose or 
nitrogen (Figure 2c). This coverage likely reflects RBP- 
RNA interactions involved in translation initiation, and 
the decrease in coverage is consistent with decreased 
translation initiation that occurs during cellular stress 
[32-34]. gPAR-CLIP also effectively captured the spliceo- 
some binding pattern by identifying intronic RBP cross- 
linking sites clustering 3' of the lariat branch point (BP) 
bound by the U2 snRNP (Figure 2d). These crosslinking 
sites contain the canonical BP-binding protein recogni- 
tion sequence UACUAAC [35,36]. Consistent with 
stress-induced transcriptional repression of ribosomal 
subunits [37], which account for 18% of all protein-cod- 
ing genes with introns, gPAR-CLIP read coverage at the 
lariat BPs of ribosome-encoding mRNA introns 
decreased upon glucose and nitrogen deprivation. 
Finally, a strong RBP crosslinking signature was identi- 
fied approximately 20 nucleotides upstream of the most 
prominent poly(A) junction site identified in each 3' 
UTR (Figure 2e), consistent with interactions with the 
polyadenylation complex [38]. Taken together, these 
results indicate that gPAR-CLIP faithfully captures 
diverse RBP-RNA interactions along the discrete anat- 
omy of mRNAs. 

RBP crosslinking sites exhibit global conservation in both 
primary sequences and secondary structures 

Compared to mRNA-seq reads, which were equally distrib- 
uted among 5' and 3' UTRs and CDSs, gPAR-CLIP reads 
were 4-fold enriched in 3' UTRs, 2.5-fold enriched in 5' 
UTRs, and 4-fold depleted in CDSs compared to mRNA- 
seq reads (Figure 3a). To examine RBP binding activity at 
nucleotide resolution, we calculated a crosslinking score 
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Figure 2 gPAR-CLIP captures known RBP crosslinking signatures, (a) Overlap of crosslinking sites identified in PufSp PAR-CLIP and wild-type {WT) 
gPAR-CLIP. PufSp PAR-CLIP crosslinking sites with >l%T-to-C conversion rate (Additional file 8) were considered captured by gPAR-CLIP if at least 50% 
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51 annotated ribosomal gene introns centered at the branch point (BP) S' end (d), and 4,241 S' UTRs centered on the poly(A) junction (e). 



(CLS) for each T in the genome (U in the transcriptome) 
as the ratio of gPAR-CLIP reads containing one or two T- 
to-C conversion events to mRNA-seq reads to normalize 
for variable mRNA abundance (Additional file 10; Materi- 
als and methods). There were 378,247 Ts (12.7% of tran- 
scriptomic Us) assigned a CLS: high CLS values indicate 
high crosslinking efficiency and strong RBP-RNA interac- 
tions; low CLS values indicate low crosslinking efficiency 
or weak/transient RBP-RNA interactions. Consistent with 
the distribution of gPAR-CLIP reads, CLS values were 
highest in 3' UTRs followed by 5' UTRs and CDSs (Figure 
3b; Additional file 11). These observations support 3' UTRs 
as the primary sites for RBP-RNA interactions for non- 
translating mRNAs. To determine if enrichment of gPAR- 
CLIP reads on UTRs was biased because of the U-richness 
of UTRs, we compared the proportion of Us in each cross- 
linking site to its coverage in gPAR-CLIP and observed 
only a weak positive correlation, which by itself cannot 



account for the four-fold enrichment of gPAR-CLIP reads 
on UTRs (Figure 3c). 

A previous comparative analysis of seven Saccharomyces 
genomes revealed that approximately 14% of evolutionarily 
constrained bases lie outside protein-coding regions, often 
located in UTRs [39]. These conserved regions could 
represent functional elements interacting with c/5-acting 
factors. We found direct evidence of RBPs crosslinking to 
35% of conserved sequence blocks in UTRs as defined by 
phastCons, a score representing the likelihood that a base 
falls in a conserved element (Figure 3d): 405 of 1,549 5' 
UTR blocks (26%) and 1,036 of 2,536 3' UTR blocks (41%) 
completely overlap with at least one RBP crosslinking site, 
which is significantly higher than randomly defined con- 
trol blocks (x^ test, P < 10'^^^ for 3' UTR and P < 10"^^ for 
5' UTR). 

At the gene level, ATG8, a key autophagy gene, con- 
tains two major crosslinking sites that overlap with 
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Figure 3 RBP crosslinking sites exhibit global sequence conservation, (a) Average ribosome depleted mRNA-seq and gPAR-CLIP read 
distributions across 5' UTR, CDS, and 3' UTR regions for all libraries. Error bar: 1 standard deviation. RPKM: reads per million mapped reads per 
kilobase. (b) Cumulative distribution of CLS values from wild-type {WT) libraries, (c) Proportion of Ts in crosslinking site binned by crosslinking 
site coverage (reads per million mapped reads (RPM)). The dashed red line indicates average T content of all crosslinking sites, (d) Number of 
conserved blocks in 3' and 5' UTRs overlapping 100% with WT gPAR-CLIP crosslinking sites (x^ P-values indicated). Control blocks were randomly 
generated within 3' and 5' UTRs to match the number and size of conserved blocks, (e) Two major gPAR-CLIP crosslinking sites in ATG8 3' UTR 
(top) and TOM40 3' UTR (bottom) overlapping conserved blocks, (f) Mean phastCons scores for Ts ranked and binned by CLS. Control lines 
represent mean phastCons scores of randomly ranked and binned Ts with no CLS, repeated ten times. 
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conserved sequence blocks in its 3' UTR (Figure 3e, top; 
Additional file 11). Similarly, TOM40, which encodes a 
translocase that mediates import of mitochondria-loca- 
lized proteins into the mitochondria, contains two major 
3' UTR crosslinking sites in regions with high local con- 
servation (Figure 3e, bottom; Additional file 11). To 
further elucidate the connection between RBP binding 
and conservation, we binned Ts by CLS values and 
observed that Ts in all 3' and 5' UTR bins, as well as the 
majority of CDS (78%) bins, were more conserved than 
randomly binned Ts, suggesting that RBP crosslinking 



sites are under purifying selection (Figure 3f; Additional 
file 11). 

Unexpectedly, 3' and 5' UTR nucleotides in the lowest 
CLS bins exhibited extremely high conservation. Since a 
low CLS can indicate inefficient RNA capture, and 
gPAR-CLIP inefficiently captures highly structured, dou- 
ble-stranded RNA (see Discussion), we hypothesized 
that low CLS/high conservation bins represent con- 
served, secondary structure motifs recognized by RBPs. 
For example, She2p binds a distinct stem-loop structure 
in several bud-localized mRNAs [40,41], including 
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ASHl, for which the She2p 3' UTR recognition element 
is weakly represented in our gPAR-CLIP dataset. To 
determine if Ts with low CLS values are located in RNA 
regions with a high degree of secondary structure, we 
computed the probability of each T being unpaired 
using RNAplfold, a local thermodynamic folding algo- 
rithm [42], We observed that Ts with low CLS values 
exhibited low unpaired probabilities, suggesting they are 



more likely to exist in double-stranded structures 
(Figure 4a; Additional file 12). Additionally, a strong, posi- 
tive correlation between unpaired probability and CLS 
values indicates that unpaired regions crosslink more 
strongly to RBPs. To probe RNA secondary structures 
more accurately, we extended the boundaries of each cross- 
linking site to span 80 nucleotides and calculated the most 
thermodynamically stable secondary structure. Consistent 
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Figure 4 RBP crosslinking sites share global structural characteristics, (a) Mean unpaired probability scores for Ts ranl<ed and binned by 
CLSs. Control lines represent mean unpaired probability of randomly ranked and binned Ts with no CLS. Pearson correlation coefficients: 5' UTR 
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with the per nucleotide analysis, crosslinking sites with low 
CLS values formed predominantly double-stranded RNA 
structures (Figure 4b; Additional file 12). 

Secondary structures tolerate substitutions that preserve 
base pairing in stem regions, a characteristic known as 
covariance. To identify conserved and thermodynamically 
stable RNA secondary structures using a covariance 
model, the seven yeast genomes were scanned with RNAz 
[43,44], and a small set of potential structural elements 
was identified: 843 in CDS, 25 in 5' UTRs, and 51 in 3' 
UTRs. Among Ts assigned a CLS, those with the lowest 
CLS values in CDS and 3' UTRs were preferentially 
located in conserved, structural elements compared to 
control elements (Figure 4c; Additional file 12), Taken 
together, our per nucleotide and per crosslinking site 
results indicate that high conservation observed for Ts 
with low CLS values is driven by conserved RNA second- 
ary structures, while Ts with high CLS values are located 
in exposed, single-stranded RNA regions available for 
sequence-specific contact with RBPs. 

Large-scale changes of RBP crosslinking site occupancy 
occur upon nutrient deprivation 

To explore RBP-RNA interaction dynamics under chan- 
ging cellular conditions, we compared gPAR-CLIP read 
coverage of individual 3' UTR crosslinking sites between 
glucose or nitrogen starvation and log-phase growth 
conditions. As we selected non-translating mRNAs for 
gPAR-CLIP analyses, we cannot distinguish whether the 
changes in binding site coverage reflect changes in RBP 
binding or changes in RBP distribution in the sucrose 
gradient (see Discussion). We only examined crosslink- 
ing sites with >5 reads per million mapped reads (RPM) 
in gPAR-CLIP libraries to ensure confident quantifica- 
tion (Additional file 13; Materials and methods). The 
intra-replicate variation of crosslinking site read cover- 
age was quantified as standard deviation a = 1.3-fold 
(Additional file 13); therefore, we consider crosslinking 
sites with more than 4-fold (3a) differences in read cov- 
erage between wild type and stress conditions as 
'increased' or 'decreased'. We observed >4-fold changes 
in crosslinking site coverage, also referred to as 'RBP 
occupancy , for 1,129 of 3,803 (30%) 3' UTR sites upon 
glucose starvation and for 535 of 3,932 (14%) 3' UTR 
sites upon nitrogen starvation (1,497 of 3,985 3' UTR 
sites in either condition, 38%) (Figure 5a,b). Similar dis- 
tributions of changes were observed for crosslinking 
sites in 5' UTRs (Additional file 13). Nineteen percent 
(116 of 623) of crosslinking sites that exhibited 
decreased RBP occupancy were affected by both condi- 
tions, while only 5% (40 of 885) of crosslinking sites 
that exhibited increased RBP occupancy were affected 
by both conditions, suggesting that RBP-RNA interac- 
tion changes are largely distinct to glucose or nitrogen 



deprivation (Figure 5c). Similar to the observation that 
glucose starvation induced more crosslinking site occu- 
pancy changes than nitrogen starvation, comparison of 
mRNA abundance revealed more changes in gene 
expression upon glucose than nitrogen starvation (Fig- 
ure 5d,e). Interestingly, mRNA expression of ribosomal 
subunits and other known RBPs was significantly down- 
regulated upon glucose (Welch's ^-test, P < 10'^^) and 
nitrogen (Welch's ^-test, P < 10'^^) starvation, suggesting 
that global suppression of post-transcriptional regulation 
is a general response to nutrient deprivation. 
We next examined the overlap of individual genes with 3' 
UTR crosslinking sites affected by each stress condition 
(Figure 5f). Genes harboring 3' UTR crosslinking sites 
with increased RBP occupancy showed little overlap (41 
genes, 6%) between the two conditions; genes harboring 
crosslinking sites with decreased RBP occupancy showed 
higher overlap (114 gene, 21%). These data suggest that, 
for the non-translated mRNA transcriptome, loss of RBP 
occupancy at crosslinking sites of a larger set of common 
genes is a general response to nutrient limitation while 
increased RBP occupancy at crosslinking sites of distinct 
sets of genes is a nutrient-specific response. 

We determined if genes exhibiting common or distinct 3' 
UTR crosslinking site occupancy changes under nitrogen 
and glucose starvation conditions had shared biological 
functions or cytological localization using Gene Ontology 
(GO) enrichment analysis (Figure 5g,h; Additional file 14). 
When we analyzed the 356 genes with sites decreased in 
RBP occupancy only during glucose starvation, mitochon- 
drion-related genes and genes associated with cellular 
respiration were preferentially affected (Figure 5g, top). 
Analysis of the 77 genes with sites lost only during nitro- 
gen starvation revealed enrichment for ribosomal compo- 
nents and noncoding RNA processing (Figure 5g, middle). 
The 114 genes harboring 3' UTR crosslinking sites with 
decreased coverage under both stress conditions were 
enriched for fatty acid and lipid catabolism (5g, bottom), 
consistent with the utilization of stored lipids as energy 
source in response to nutrient deprivation [45]. 

Analysis of the 400 genes harboring 3' UTR crosslink- 
ing sites with increased occupancy only upon glucose 
starvation were enriched for terms related to translation 
(Figure 5g, top). The 254 genes harboring sites with 
increased RBP occupancy only upon nitrogen starvation 
were enriched for metabolic processes, including gluta- 
mate metabolic processes, which are affected by nitrogen 
availability (Figure 5h, middle). Of the 41 genes harboring 
3' UTR crosslinking sites with increased RBP occupancy 
under both nitrogen and glucose starvation conditions, 
13 (32%) genes represent cellular components of ribo- 
somes or mitochondria (Figure 5h, bottom), consistent 
with induction of global changes through translational 
repression and changes in energy metabolism. 
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Figure 5 Nutrient deprivation induces widespread but distinct RBP-crosslinking and mRNA changes. (a,b) Changes in 3' UTR crosslinking 
site coverage upon glucose (a) or nitrogen (b) starvation. Standard deviations of intra-replicate variation: wild-type {WT), 1.31 -fold; glucose 
starvation, 1.24-fold; nitrogen starvation, 1.15-fold, (c) Overlap of 3' UTR crosslinking site changes affected by glucose or nitrogen starvation 
conditions. (d,e) Global changes in mRNA abundance upon glucose (d) or nitrogen (e) starvation, (f) Overlap of mRNAs with 3' UTR crosslinking 
site changes affected by glucose or nitrogen starvation conditions. (g,h) Enriched Gene Ontology terms for mRNAs with 3' UTR crosslinking sites 
with decreased (g) or increased (h) RBP occupancy upon glucose or nitrogen starvation or both. Grey lines indicate P-value of 0.05. 
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In order to determine whether these observations are 
a result of changes in mRNA abundance, we calculated 
GO term enrichment of mRNAs up- or down- regulated 
upon glucose or nitrogen starvation and observed that 
down-regulated mRNAs are enriched for ribosome- and 
translation-related genes, while up-regulated mRNAs are 
enriched for genes related to mitochondrion and meta- 
bolic processes (Additional file 15). Therefore, the GO 
term enrichment of genes with changes in 3' UTR site 
occupancy cannot be fully explained by GO term 
enrichment of up- or down-regulated mRNAs. Taken 
together, these data indicate that general nutrient limita- 
tion triggers a remodeling of the post-transcriptional 
regulatory programs of metabolic pathways, while glu- 
cose- and nitrogen-specific stresses affect additional, dis- 
tinct biological processes. 

We further visualized changes in RBP occupancy of 3' 
UTR crosslinking sites relative to the changes in corre- 
sponding mRNA abundance induced by glucose starvation 



(Figure 6a; Additional file 16). Since 3' UTR crosslinking 
sites with decreased RBP occupancy were enriched for 
mitochondrion-related genes, we examined sites on a sub- 
set of these genes encoding mitochondrial membrane 
components and observed that the crosslinking sites were 
significantly depleted of RBP occupancy compared to all 3' 
UTR crosslinking sites (Welch's ^-test, P < 10'^^), and the 
mRNAs were significantly up-regulated compared to all 
genes (Welch's t-test, P < 10'^^) (Figure 6a, blue dots). 
This observation suggests that 3' UTR crosslinking sites 
on mRNAs encoding mitochondrial membrane compo- 
nents are recognized by repressive RBPs, and that upon 
glucose deprivation, RBP-binding is attenuated, resulting 
in increased mRNA levels. 

Mitochondrial aldehyde dehydrogenase ALD4 mRNA is 
regulated transcriptionally under stress conditions [46,47]. 
In our gPAR-CLIP data, the ALD4 3' UTR harbors four 
highly conserved crosslinking sites displaying 2- to 8-fold 
decreases in RBP occupancy despite a >7-fold increase in 
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Figure 6 Glucose starvation induces RBP-crosslinking and mRNA changes associated with mitochondrial processes, (a) Changes in 3' 
UTR crosslinking site coverage versus changes in the corresponding mRNA upon glucose starvation. Crosslinking sites on genes annotated with 
the 'mitochondrial membrane' GO term are colored blue. Dotted lines indicate >4-fold changes in crosslinking site coverage (vertical) or >2-fold 
change in mRNA expression (horizontal), (b) ALD4 3' UTR contains four crosslinking sites that decrease two- to eight-fold in RBP occupancy upon 
glucose starvation and overlap with conserved blocks (red diamonds in (a)). ALD4 mRNA expression is up-regulated upon glucose starvation, (c) 
STMl 3' UTR contains one crosslinking site that increases in coverage upon glucose starvation (red diamond in (a)). STMl mRNA expression is 
down-regulated upon glucose starvation. WT, wild type. 
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ALD4 mRNA levels (Figure 6b; Additional file 17). These 
data suggest that post-transcriptional regulation of ALD4 
in response to glucose deprivation also occurs through the 
release of repressive RBP binding at these 3' UTR sites. 
STMl, which encodes a ribosomal subunit-associated pro- 
tein required for optimal translation under nutrient stress 
[48], has two 3' UTR crosslinking sites, with one exhibiting 
>25-fold increased RBP occupancy upon glucose starva- 
tion (Figure 6c; Additional file 17). STMl mRNA is con- 
versely down-regulated >3-fold, indicating a potential 
regulatory role for this site involving mRNA stability and/ 
or decay. Interestingly, STMl mRNA expression is also 
down-regulated upon nitrogen starvation despite no 
change in RBP occupancy of this site, pointing to non- 
overlapping regulatory mechanisms that contribute to 
STMl regulation in glucose and nitrogen starvation 
conditions. 

Next we explored changes in RBP occupancy of 3' UTR 
crosslinking sites relative to changes in corresponding 
mRNA abundance upon nitrogen starvation. 3' UTR 
crosslinking sites on mRNAs associated with ribosome 
biogenesis showed significantly greater decrease in RBP 
occupancy compared to all 3' UTR crosslinking sites 
(Welch's ^-test, P < 10'^^) (Figure 7a, red dots; Additional 
file 18). Inositol-3-phosphate synthase INOl is transcrip- 
tionally regulated under stress [49]. INOl 3' UTR has 
four conserved crosslinking sites, one of which exhibits a 
>50-fold increase in RBP occupancy upon nitrogen star- 
vation despite a > 10-fold decrease in INOl mRNA levels 
(Figure 7b; Additional file 19). These data suggest post- 
transcriptional regulation of INOl mRNA by a specific 
RBP-RNA interaction in the 3' UTR. We also identified 
three crosslinking sites under normal growth conditions 
in the 3' UTR ofAGPS, an amino acid permease capable 
of supplying amino acids as an alternative nitrogen 
source in nitrogen-poor conditions [50]. RBP occupancy 
at these sites was completely lost upon nitrogen starva- 
tion while two additional sites emerged (Figure 7c; Addi- 
tional file 19). AGP3 mRNA levels moderately increased 
approximately two-fold (Figure 7c), suggesting complex, 
combinatorial post-transcriptional regulation of AGP3 
expression in nitrogen-poor conditions. 

Discussion 

RNP complexes exhibit dynamic properties that are sen- 
sitive to environmental conditions. For example, granules 
containing stalled translation pre-initiation complexes 
are formed under stress but rapidly dissociate when the 
cell returns to favorable conditions [51]. Despite insight 
into how particular RNP complexes are affected by stress, 
global effects of stress on all RBP-RNA interactions have 
until now remained unexplored. We detect reproducible 
changes in occupancy for 38% of 3' UTR crosslinking 
sites on non-translating mRNAs under glucose or 



nitrogen starvation conditions: loss of RBP occupancy at 
RBP crosslinking sites was a phenomenon common to 
both glucose and nitrogen stress conditions, while more 
distinct sets of crosslinking sites increased RBP occu- 
pancy (Figure 5c). 

In our current work, we limited our gPAR-CLIP analyses 
to protein-RNA interactions residing in non-translated 
RNPs (Figure la; Materials and methods), which mediate 
important functions for mRNA translation, localization, 
and degradation [52]. Because we have no information on 
the identities of the RBPs or their distribution in the 
sucrose gradient, we cannot distinguish whether the 
changes in RBP coverage represent changes in RBP bind- 
ing and/ or distribution. This is particularly relevant in glu- 
cose or nitrogen starvation, as many RBPs redistribute 
under these conditions [53]. Future comparative gPAR- 
CLIP analyses on both non-translating RNP and translat- 
ing RNPs in stress conditions will distinguish changes in 
RBP binding versus changes in RBP localization. 

RNAs are capable of forming complex two- and three- 
dimensional structures, and some RBPs are known to 
recognize such structural motifs. For example, She2p 
mediates the localization of several bud-localized tran- 
scripts during cell division by recognizing and binding to 
specific stem-loop structures in mRNAs [40,41]. Exami- 
nation of the structural properties of our global RBP 
crosslinking sites revealed a preference for single- 
stranded regions, which agrees with previous reports of 
crosslinking sites of the RNA-binding protein FUS occur- 
ring at single-stranded regions directly adjacent to the 
FUS RNA recognition motif [54]. Unpaired loop and 
bulge regions can be unstructured or form tertiary struc- 
tural modules, both of which can be readily recognized 
by RBPs. In contrast, double-stranded RNAs, in general, 
do not provide good platforms for RBP binding: struc- 
tured RNA regions captured by gPAR-CLIP generally 
had low CLS values (Figure 4), likely resulting from 
crosslinking and/or RNase Tl cleavage inefficiency. In 
structured regions, 4-thiouridines are more likely to be 
locked in U:A or U:G pairing, preventing crosslinking to 
proteins. In addition, structured regions are less accessi- 
ble to RNase attack during sequencing fragment prepara- 
tion, resulting in under-representation in gPAR-CLIP 
libraries. Nevertheless, despite their low crosslinking effi- 
ciencies, Ts in double-stranded, paired RNA regions 
show extremely high conservation compared to Ts with 
no crosslinking evidence. These data indicate that RNAs 
with high secondary structure are evolutionarily con- 
served and can serve as functional, secondary structure 
motifs recognized by select RBPs. 

RBP binding sites functioning as c/5-regulatory ele- 
ments are expected to be under purifying selection. We 
identified a substantial fraction (35%) of conserved ele- 
ments in UTRs overlapping RBP crosslinking sites. This 
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Figure 7 Nitrogen starvation induces specific RBP-crosslinking and mRNA changes associated with ribosomes and translation-related 
processes, (a) Global changes in 3' UTR crosslinking site coverage versus changes in the corresponding mRNA upon nitrogen starvation. 
Crosslinking sites on genes annotated with 'ribosome biogenesis' GO term are colored red. Dotted lines indicate >4-fold changes in crosslinking 
site coverage (vertical) or >2-fold change in mRNA expression (horizontal), (b) INOl 3' UTR contains one crosslinking site that increases in 
coverage upon nitrogen starvation and falls within a conserved block (blue diamond in (a)). INOl mRNA expression is down-regulated upon 
nitrogen starvation, (c) AGP3 3' UTR contains three crosslinking sites that are lost and two crosslinking sites that appear upon nitrogen starvation 
(blue diamonds in (a)). AGP3 mRNA expression is up-regulated upon nitrogen starvation. 



represents an underestimation because RBPs and RNAs 
that are not expressed under our experimental conditions 
or that fail to crosslink will not be captured. Although 
crosslinking sites in general are more highly conserved 
than non-crosslinking sites in UTRs, many sites are not 
well conserved and might represent species-specific cis- 
regulatory elements that allow adaptation to different 
environments and stressors. 

A preference of RBP binding to 3' UTRs observed in this 
study and others [13,14] is consistent with the function 
and evolution of 3' UTRs as major sites for post-transcrip- 
tional regulation. Unlike protein-coding regions, 3' UTRs 
do not directly engage ribosomes during translation and 
therefore provide accessible platforms for RBP binding 
and RNP assembly. One important aspect of gene regula- 
tion is combinatorial control, which allows a single gene 
to be controlled by more than one regulator. In our study, 
23% of all nucleotides in annotated 3' UTRs were located 
within RPB crosslinking sites, corresponding to an average 
of 1 crosslinking site, on average 23 nucleotides long, in 



every 100 nucleotides. For a median-sized yeast 3' UTR 
that is 166 nucleotides long [38], there are, on average, 2 
RBP crosslinking sites, suggesting that most yeast genes 
are subject to combinatorial post-transcriptional regula- 
tion. Since S. cerevisiae lacks post-transcriptional regula- 
tion by the highly conserved and pervasive microRNA 
regulatory pathway, combinatorial regulation by RBPs may 
play a more prominent role than in organisms with small 
RNA-mediated post-transcriptional gene regulation. 

Unlike focused interrogation of individual RBPs, gPAR- 
CLIP does not directly identify the RBP that recognizes 
each crosslinking site. To enable identification of primary 
sequence motifs recognized by individual RBPs, we 
searched gPAR-CLIP crosslinking sites located on target 
mRNAs identified in vitro by RIP-Chip for 29 RBPs 
[12,13] and identified 39 motifs for 15 RBPs (Additional 
file 20; Materials and methods). Notably, 35 of the 
sequence motifs derived by gPAR-CLIP differed signifi- 
cantly from previous motif predictions, which were based 
on scanning whole transcript sequences for enriched 
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k-mers. This discrepancy between primary sequence 
motifs identified by our gPAR-CLIP data and previous 
predictions illustrates the potential utility of deriving 
motifs based on direct in vivo evidence of RBP-RNA inter- 
actions, which narrows the search space to enhance the 
signal of bona fide primary sequence recognition elements. 

Conclusions 

Our study provides a comprehensive map of RBP cross- 
linking sites across the budding yeast non-translating 
mRNA transcriptome and for the first time describes 
the dynamics of mRNA-RBP binding under normal and 
nutrient-limited growth conditions. Delineating in vivo 
sites of RBP binding will aid in directing future studies 
for identification of sites responsive to environmental or 
genetic perturbations, refinement of primary sequence 
and secondary structural elements recognized by specific 
RBPs, and elucidation of the complex network of regula- 
tory processes that contribute to regulation of expres- 
sion of each individual mRNA. gPAR-CLIP is readily 
applicable to other organisms for profiling global RNA- 
protein interactions underlying post-transcriptional reg- 
ulation and the effects of environmental perturbations 
upon these interactions. 

Materials and methods 

Strains, media and growth conditions 

The following strains were used in this study: wild-type 
BY4742 {MATa hisSM leu2 AO lys2 AO uraS AO), TAP- 
tagged strains picked from TAP-tagged yeast strain collec- 
tion (MATa hisSAl leu2A0 met IS AO uraSAO YFG-TAP:: 
HISS) [55]. Strains were grown at 30°C with vigorous shak- 
ing (250 rpm) in synthetic defined media, supplemented 
with 200 |iM 4sU (Sigma- Aldrich 440736; St. Louis, MO, 
USA, to ODeoo = O-^ to 0.8. Starvation was performed by 
pelleting cells for 5 minutes at 3,000Xg at room tempera- 
ture, discarding all media, rinsing once with H2O, and 
resuspending cells in an equal volume of synthetic defined 
media without glucose or nitrogen (supplemented with 
200 [iM 4sU). Cells were returned to 30°C with shaking 
for 2 h. Strains used are defective in uracil synthesis 
{uraSA) and readily take up 4sU from the media. Inside 
the cell, 4sU is converted by Furlp (uracil phosphoribosyl- 
transferase) to 4-thiouridine monophosphate that can be 
incorporated during RNA synthesis. 

Estimation of 4sU incorporation rates 

4sU incorporation rates were measured as described 
[56]. Briefly, RNA samples isolated from cells grown in 
the presence or absence of 4sU were dissolved in 100 \A 
of 12 mM Tris buffer, pH 7, and their A260 absorption 
was adjusted to the same value. A330 was measured for 
both samples using a Q6 quartz cuvette with 1 mm 
light path in a Thermo Scientific BioMate 3 UV-Vis 



spectrophotometer. 4sU incorporation rates per kilobase 
RNA were calculated as 500 x [(A33o(+4sU)) - (A330 
(-4sU))]/A260' 4sU was incorporated at roughly four 4sU 
per kilobase of transcript, with little interference with 
cell growth and only minor changes in gene expression 
(Additional file 3). 

gPAR-CLIP procedures 
UV crosslinking 

Mid-log phase cultures (50 ml; OD^oo of 0.7 to 0.8) were 
pelleted for 5 minutes at 3,000Xg at room temperature, 
resuspended in 2 ml of Ix HBSS (Invitrogen 14025; 
Grand Island, NY, USA) and transferred to a 60 mm cell 
culture dish (BD Biosciences 353002; San Jose, CA, 
USA), placed on ice, and irradiated with 365 nm UV at 
150 mj/cm^ four times using a UVP CL-IOOOL UV cross- 
linker. The cells were then pelleted for 2 minutes at 
5,000Xg at 4°C. After removing Ix HBSS, the cells were 
frozen in liquid nitrogen. 
Extract preparation 

Crosslinked cells were resuspended in polysome lysis buf- 
fer (20 mM HEPES pH 7.5, 140 mM KCl, 1.5 mM MgCl2, 
1% Triton X-100, Ix Complete Mini Protease Inhibitor 
EDTA-free (Roche Applied Science 1 836 170; Indianapo- 
lis, IN, USA), 0.2 U/(il SUPERase In (Invitrogen 
AM2696)), mixed with half a volume of acid -washed glass 
beads, and lysed by vortexing four times at 4°C, 1 minute 
each with a 1 minute incubation on ice in between. Cell 
debris was removed by centrifugation for 5 minutes at 
l,300Xg at 4°C. The supernatant was cleared by 20,000Xg 
spin for 10 minutes at 4°C. 

Ribosome depletion using sucrose density gradients 

Sucrose density gradients (15 to 50% (w/v)) were prepared 
in Beckman polycarbonate centrifugation tubes (11 x 34 
mm) by sequentially layering and freezing 0.24 ml of 50%, 
41.25%, 32.5%, 23.75% and 15% sucrose dissolved in poly- 
some gradient buffer (20 mM HEPES pH 7.5, 140 mM 
KCl, 5 mM MgCl2). Gradients were thawed overnight at 4° 
C before use. 100 [i\ of clarified lysate was loaded on top of 
a gradient, centrifuged for 1 h at 54,000 rpm at 4°C using a 
TLS-55 rotor in an Optima MAX-E ultracentrifuge (Beck- 
man Coulter; Palo Alto, CA, USA). The top 600 \i\ of the 
gradient were recovered and supplemented with 2 \i\ of 
SUPERase In (20 U/(il). 
Chemical biotinylation and polyA selection 
Sixty microliters of freshly prepared 10 mM EZ-Link NHS- 
SS-Biotin (Pierce 21441; Rockford, IL, USA) dissolved in 
dimethylformamide was added to the recovered lysate and 
incubated on a Nutator for 2 h at 4°C; 50 (il of 5 M NaCl 
was added to increase the total salt concentration to 0.5 M. 
Biotinylated lysate was mixed with 1 mg of oligo(dT)25 
magnetic beads (NEB S1419S; Ipswich, MA, USA), then 
incubated on a Nutator for 30 minutes at 4°C. The beads 
were washed four times with ice-cold hybridization buffer 
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(10 mM HEPES pH 7.5, 0.5 M NaCl, 1 mM EDTA) and 
the RNAs were eluted by incubating beads with 500 [d of 
elution buffer (10 mM HEPES pH 7.5, 1 mM EDTA) and 
heating at 65°C for 3 minutes. The eluted sample was 
transferred to a new tube and mixed with 55 \il of lOx 
phosphate-buffered saline (PBS). 
Streptavidin binding and RNase T1 digestion 
PolyA-selected samples were mixed with 1 mg of strepta- 
vidin M280 Dynabeads (Invitrogen 112-05D) and incu- 
bated on a Nutator for 30 minutes at 4°C. The beads 
were washed three times with Ix PBS, then incubated 
with 20 \i\ of 50 U/(il RNase Tl (Fermentas EN0541; 
Waltham, MA, USA, 1:20 dilution in Ix PBS) at 22°C for 
15 minutes on an Eppendorf Thermo mixer (15 s shaking 
at 1,000 rpm followed by a 2 minute rest interval), fol- 
lowed by a 5 minute incubation on ice. Beads were 
washed twice with wash buffer (Ix PBS, 0.1% SDS, 0.5% 
deoxycholate, 0.5% NP-40), twice with high-salt wash 
buffer (5x PBS, 0.1% SDS, 0.5% deoxycholate, 0.5% NP- 
40) and twice with Ix PNK buffer (50 mM Tris pH 
7.4,10 mM MgCl2, 0.5% NP-40). 
On-bead CIP treatment 

Beads were incubated with 20 [i\ of CIP mix (50 mM 
Tris pH 7.9, 100 mM NaCl, 10 mM MgCl2, 0.5 U/[i\ 
calf intestinal alkaline phosphatase (CIP); NEB M0290S) 
at 37°C for 15 minutes, with 15 s shaking at 1,000 rpm 
followed by a 2 minute rest interval on a Thermomixer. 
After CIP treatment, beads were washed twice with Ix 
PNK+EGTA buffer (50 mM Tris pH 7.4, 20 mM EGTA, 
0.5% NP-40) and twice with Ix PNK buffer. 
On-bead 3' DNA linl<er ligation 

Beads were incubated with 20 \i\ of ligation mix (50 mM 
Tris pH 7.4, 10 mM MgCl2, 0.5 mM DTT, 2 \iM pre-ade- 
nylated 3' DNA linker, 25% PEG-8000, 10 U/[i\ T4 RNA 
ligase 2, truncated K227Q; NEB M0351S) at 16°C over- 
night (>16 h), with 15 s shaking at 1,000 rpm followed by 
a 2 minute interval on a Thermomixer. After linker liga- 
tion, beads were washed three times with Ix PNK 
+EGTA buffer. 

SDS-PAGE and nitrocellulose transfer 

Beads were mixed with 12 (il of Ix PNK+EGTA buffer, 3 \i\ 
of freshly made 1 M DTT and 15 [i\ of 4x NuPAGE LDS 
sample buffer (Invitrogen NP0007), and incubated at 70°C 
for 10 minutes. Beads were removed, and the supernatant 
was loaded onto NuPAGE 4-12% Bis-Tris gel (Invitrogen 
NP0335BOX) and run at 150 V for 35 minutes. The gel 
was transferred to Protran BA 85 nitrocellulose membrane 
(pore size 0.45 (im, Whatman 10402594; St. Louis, MO, 
USA) using Novex wet transfer at 30 V for 1 h. A broad 
band from 31 kDa up to the top of the gel was excised, cut 
into small pieces, and transfered into a microfuge tube. 
RNA isolation and purification 

Excised membranes were incubated with 500 (il of 4 
mg/ml Proteinase K prepared in Ix PK buffer (100 mM 



Tris pH 7.5, 50 mM NaCl, 10 mM EDTA) for 20 min- 
utes at 37°C on a Thermomixer. We added 500 [i\ of 
7 M urea prepared in Ix PK buffer to the tube followed 
by another 20 minute incubation at 37°C. The Protei- 
nase K digestion reaction was mixed with 1 ml of phe- 
nol:chloroform:isoamyl alcohol 25:24:1 (Sigma- Aldrich 
P2069) by vortexing and spun for 5 minutes at 
20,000Xg. The liquid phase was transferred into a new 
tube, mixed with 125 (il of 3 M NaOAc, 2.5 ml of 100% 
ethanol and 1 \i\ of 15 mg/ml glycoblue (Invitrogen 
AM9516), and precipitated for 2 h at -80°C. RNAs were 
collected by centrifugation for 20 minutes at 20,000Xg 
at room temperature followed by two washes with cold 
75% ethanol. 

RNA 5' end phosphorylation 

RNA pellets were air-dried briefly, resuspended in 10 (il of 
PNK mix (70 mM Tris pH 7.6, 10 mM MgCl2, 5 mM 
DTT, 1 mM ATP, 1 U/(il T4 polynucleotide kinase (NEB 
M0201S), 1 U/(il SUPERase In) and incubated at 37°C for 
30 minutes. The reaction was combined with 90 (il of 
H2O and 100 (il of phenol:chloroform:isoamyl alcohol 
25:24:1, mixed well and spun for 5 minutes at 20,000Xg. 
The liquid phase was mixed with 12.5 (il of 3 M NaOAc, 
250 \A of 100% ethanol, 1 \A of 15 mg/ml glycoblue and 
precipitated for 2 h at -80°C. RNAs were collected by cen- 
trifugation for 20 minutes at 20,000Xg at room tempera- 
ture, followed by two washes with cold 75% ethanol. 
5' RNA linker ligation 

RNA pellets were resuspended in 10 \A of ligation mix 
(50 mM Tris pH 7.5, 10 mM MgCb, 10 mM DTT, 1 mM 
ATP, 0.1 mg/ml bovine serum albumin, 2 (iM 5' RNA lin- 
ker, 1 U/(il T4 RNA ligase (Fermentas EL0021), 1 U/(il 
SUPERase-In, 10% DMSO) and incubated at 15°C for 2 h. 
RNA size selection 

Ligation reaction was terminated by adding 10 (il of 2x 
formamide gel loading buffer (Invitrogen AM8546G), 
heated for 2 minutes at 70°C and then quickly chilled on 
ice. Samples were loaded onto a 6% TBE UREA gel (Invi- 
trogen EC6865BOX) and run at 150 V for 45 minutes. 
After staining with Ix Sybr Gold Stain (Invitrogen 
S- 11494), a gel piece corresponding to a 70 to 90 nucleo- 
tide RNA (80 to 100 nucleotide single- stranded DNA) was 
excised, crushed, and soaked in 400 (il of 0.3 M NaOAc 
overnight at room temperature. After removing gel pieces, 
the solution was combined with 1 ml of 100% ethanol and 
1 (il of 15 mg/ml glycoblue and precipitated for 2 h at -80° 
C. RNAs were collected by centrifugation for 20 minutes 
at 20,000Xg at room temperature, followed by two washes 
with cold 75% ethanol. After brief drying, RNAs were 
resuspended in 15 (il of H2O. 
RT-PCR 

The ligated RNA (10 (il) was combined with 2 (il of 5 
(iM RT primer, heated at 65°C for 5 minutes, and then 
quickly chilled on ice, and followed by the addition of 
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1 (il of 10 mM dNTP, 1 [i\ of 0.1 M DTT, 4 \i\ of 5x 
First strand buffer, 1 [A of SUPERase-In (20 U/[i\) and 1 
[i\ of Superscript III Reverse transcriptase (Invitrogen 
18080-093, 200 U/(il). The RT reaction was kept at 50°C 
for 45 minutes, 55°C for 15 minutes and 90°C for 5 
minutes. A test PGR was performed with 2.5 \i\ of RT 
product in 50 (il PGR mix: Ix AccuPrime PGR buffer I, 
0.5 [iM P5 long primer, 0.5 [iM P7 primer, 0.2 (il Accu- 
Prime Taq High FideUty (Invitrogen 12346-086, 5 U/(iI). 
PGR was carried out with an initial 3 minute denatura- 
tion at 98°G, followed by 14 to 22 cycles of 80 s dena- 
turation at 98°G, 90 s annealing and extension at 65°G, 
and termination with a final 5 minute extension at 65°G. 
PGR product (15 was collected after 14, 18, and 22 
cycles and analyzed on a 10% TBE gel (Invitrogen 
EG6275BOX) at 150 V for 1 h to determine the optimal 
amplification cycles (the lowest cycle number required 
to generate 96 to 116 bp amplicons detected by Sybr 
Gold staining). 

Preparation of sequencing libraries 

A 50 \i\ PGR reaction was carried out with the deter- 
mined cycle number. Amplicons were purified using 
DNA clean and concentrator-5 (Zymo D4013; Irvine, 
GA, USA), run on 10% TBE gels at 150 V for 1 h and 
stained with Sybr Gold. A gel piece corresponding to 96 
to 116 bp DNA was excised, crushed, and soaked over- 
night in 400 (il 0.3 M NaOAc at room temperature. 
After removing gel pieces, the solution was combined 
with 1 ml of 100% ethanol and 1 (il of 15 mg/ml glyco- 
blue and precipitated for 2 h at -80°G. DNAs were col- 
lected by centrifugation for 20 minutes at 20,000Xg at 
room temperature, followed by two washes with cold 
75% ethanol. After brief drying, amplicons were resus- 
pended in 20 [i\ of H2O. Purified amplicons (5 were 
used to seed a second round of PGR in 50 [ih Ix Accu- 
Prime PGR buffer I, 0.5 [iM lUumina Primer A, 0.5 (iM 
lUumina Primer B, 0.2 \i\ AccuPirme Taq High Fidelity 
for 6 to 12 cycles. Second PGR amplicons were purified 
with DNA clean and concentrator-5 (Zymo D4013) and 
sequenced on an lUumina HiSeq 2000 sequencer. 

Puf3p PAR-CLIP procedures 

Puf3p PAR-GLIP was performed similarly to gPAR-GLIP 
with the following modifications. The PUF3-TAP::HISS 
strain was cultured and UV-crosslinked as in gPAR- 
GLIP. Gells were lysed in Ix PBS, 0.5% NP-40, Ix Gom- 
plete Mini Protease Inhibitor, EDTA-free and cleared by 
sequential spins at l,300Xg for 5 minutes and 20,000Xg 
for 10 minutes at 4°G. The clarified lysate was passed 
through a Gostar Spin-X filter (Gorning GLS8160; St. 
Louis, MO, USA), mixed with RNase Tl (Fermentas 
EN0541) to 1 U/(il, and incubated at 22°G for 15 minutes 
followed by a 5 minute incubation on ice. The lysate was 
then directly mixed with IgG magnetic beads (prepared 



by coupling rabbit IgG (Sigma- Aldrich 15006) to Dyna- 
beads M-270 Expoxy (Invitrogen 143-01)) to pull down 
Puf3p::TAP, RNase Tl digestion, GIP treatment, and 3' 
DNA linker ligation were performed as described in 
gPAR-GLIP. Afterwards, 5' end phosphorylation was per- 
formed on-bead in 20 (il of PNK mix (70 mM Tris pH 
7.6, 10 mM MgGl2, 5 mM DT, 1 \A P32 rATP (6000 Gi/ 
mmol 10 mGi/ml Perkin Elmer BLU502Z500UG; Wal- 
tham, MA, USA), 1 U/(il T4 polynucleotide kinase) and 
incubated at 37°G for 15 minutes. ATP (2 \A of 10 mM) 
was added to the mix and the reaction was incubated for 
10 minutes. After SDS-PAGE and transfer, crosslinked 
RNAs were visualized by autoradiography and the corre- 
sponding Puf3p band was excised. The remaining steps 
were carried out as described in gPAR-GLIP procedures, 
omitting the 5' end phosphorylation step. 

mRNA-seq procedures 

Yeast strains were grown under normal and starvation 
conditions described above in the presence of 4sU. 
Additional replicate mRNA-seq libraries were prepared 
with yeast strains grown under normal conditions with- 
out the addition of 4sU. 

Total RNAs were extracted with acid-phenol:chloro- 
form, pH 4.5 with isoamyl alcohol, 25:24:1 (Ambion; 
Grand Island, NY, USA). Replicate, strand-specific total 
mRNA-seq libraries were prepared in parallel using the 
two linker ligation protocol as described [57]. 

For preparation of ribo- mRNA-seq libraries, extract 
preparation, and ribosome depletion using sucrose den- 
sity gradients were carried out as described in the 
gPAR-GLIP procedure (avoiding UV crosslinking). 
PolyA+ mRNAs were enriched using oligo(dT)25 beads 
and converted into sequencing libraries as described 
[57]. 

Oligos for constructing gPAR-CLIP, PAR-CLIP, and mRNA- 
seq libraries 

Oligos used in this study were synthesized by Integrated 
DNA Technologies, except the 5' RNA linker, which 
was synthesized by Dharmacon (Waltham, MA, USA). 
Barcodes 

3' DNA linker oligos (5' phosphorylated, and 3' block 
with inverted deoxythymindine): 

Index 1: 5' pATCACGTGGTATGGGGTGTTGTGGTT 
GidT 3' 

Index 2: 5' pCGATGTTGGTATGGGGTGTTGTGGT 
TGidT 3' 

Index 3: 5' pTTAGGCTGGTATGGGGTGTTGTGGT 
TGidT 3' 

Index 4: 5' pTGACCATGGTATGGGGTGTTGTGGTT 
GidT 3' 

Index 5: 5' pACAGTGTGGTATGGGGTGTTGTGGT 
TGidT 3' 
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Index 6: 5' p GCCAAT TCGTATGCCGTCTTCTGCT 
TGidT 3' 

Index 7: 5' p CAGATC TCGTATGCCGTCTTCTGC 
TTGidT 3' 

Index 8: 5' p ACTTGA TCGTATGCCGTCTTCTGCT 
TGidT 3' 

Pre-adenylation of 3' DNA linker oligos was per- 
formed with Mth RNA ligase (5' DNA adenylation kit, 
NEB E2610S) following the vendor's instructions. 
5' RNA linker 

5' GUUCAGAGUUCUACAGUCCGACGAUC 3'. 
Barcoded RT primers 

Index 1: 5' CAAGCAGAAGACGGCATACGA CGTGAT 3' 
Index 2: 5' CAAGCAGAAGACGGCATACGA ACA 
TCG_3' 

Index 3: 5' CAAGCAGAAGACGGCATACGA GCCT 
AA_3' 

Index 4: 5' CAAGCAGAAGACGGCATACGA 
TGGTCA 3' 

Index 5: 5' CAAGCAGAAGACGGCATACGA CAC 
TGT 3' 

Index 6: 5' CAAGCAGAAGACGGCATACGAA 
TTGGC 3' 

Index 7: 5' CAAGCAGAAGACGGCATACGA GATC 
TG_3' 

Index 8: 5' CAAGCAGAAGACGGCATACGA TCAA 
GT 3' 
P7 primer 

5' CAAGCAGAAGACGGCATACGA 3'. 
P5 long primer 

5' AATGATACGGCGACCACCGACAGGTTCAGAG 
TTCTACAGTCCGA 3\ 
lllumina primer A 

5' AATGATACGGCGACCACCGA 3'. 
lllumina primer B 

5' CAAGCAGAAGACGGCATACGA 3'. 

Data processing of lllumina HiSeq sequencing reads 

gPAR-CLIP, PAR-CLIP, and mRNA-seq reads were pro- 
cessed to remove linkers, sorted into libraries based on 
six-nucleotide barcodes (underlined residues above), and 
removed if they were low quality (Additional file 2). 
First, reads with a perfect match to a barcode were suc- 
cessfully sorted, followed by reads with one mismatch to 
a barcode. If both barcodes were perfectly matched in a 
read or both barcodes were found with one mismatch, 
the 3'-most barcode was chosen; 99.97% of reads were 
successfully sorted into libraries under these rules. Next, 
reads were removed if they met any of the following cri- 
teria: <18 nucleotides, only homopolymer As, missing 3' 
adapter, 5'-3' adapter ligation products, 5'-5' adapter 
ligation products, low quality (more than 4 bases with 
quality scores below 10 or more than 6 bases with a 
quality score below 13); 97.0% of gPAR-CLIP, 80.0% of 



PAR-CLIP, and 99.7% of mRNA-seq reads passed these 
filters. High quality reads were mapped to the S. cerevi- 
siae genome version S288C with Bowtie [58] using the 
following parameters: -v 3 (map with up to 3 mis- 
matches), -k 275 (map at up to 275 loci), -best, and 
-strata. Mapped reads were annotated using custom 
scripts to known genomic elements in the S288C gen- 
ome (sacCer3, April 2011) including external UTR 
annotations [59,60]. Additional file 1 provides read 
counts at each processing step. 

Assessing data reproducibility 

To determine mRNA-seq and gPAR-CLIP replicate 
library reproducibility, we calculated repUcate correla- 
tion using normalized read counts (RPM) of each gene. 
Pearson correlation coefficients for mRNA-seq libraries 
ranged from 0.984 to 0.994, while coefficients for gPAR- 
CLIP libraries ranged from 0.967 to 0.971. Due to high 
reproducibility, subsequent measures of read coverage 
represent averages of two biological replicate libraries. 
To determine if the addition of 4sU to growth media 
substantially alters transcription, biological replicate 
mRNA-seq libraries were generated from wild-type yeast 
grown under normal conditions without the addition of 
4sU. These replicate libraries had a Pearson correlation 
coefficient of 0.988, indicating high reproducibility, and 
a Pearson correlation coefficient of 0.982 when com- 
pared to wild-type yeast grown in the presence of 4sU. 

Calculation of per-nucleotide crosslinking scores 

To measure RBP crosslinking strength, we calculated a 
crosslinking score for each genomic T position as the 
RPM coverage from reads with a T-to-C at that posi- 
tion. Because transcript abundance varies greatly, from 
zero to tens of thousands of copies, T-to-C coverage of 
crosslinking sites on highly expressed genes would be 
preferentially higher than T-to-C coverage of crosslink- 
ing sites of lowly expressed genes. To avoid this bias, we 
normalized T-to-C RPMs to length-normalized tran- 
script abundances (RPKM) from our mRNA-seq 
libraries. Two percent of Ts with T-to-C RPM coverage 
in gPAR-CLIP libraries were located on genes that 
lacked mRNA-seq coverage and were thus removed 
from further analysis. To adjust for the additional kilo- 
base normalization factor used in RPKM, ratios of 
gPAR-CLIP RPM:mRNA-seq RPKM were multiplied by 
a factor of 1,000. 

Calculation of RBP crosslinking sites 

Generation of read clusters from gPAR-CLIP libraries 

All six gPAR-CLIP libraries were aggregated into one 
large dataset to generate read clusters. A read cluster 
was defined as a continuous stretch of nucleotides cov- 
ered by at least one gPAR-CLIP read harboring one or 
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two T-to-C conversion events. This step resulted in 
84,136 gPAR-CLIP clusters and 1,915 PufSp PAR-CUP 
clusters. 

Defining crosslinl<ing site boundaries 

Manual inspection of read clusters revealed long (>100 
nucleotide) regions covered by gPAR-CLIP reads contain- 
ing one or more distinct peaks indicative of distinct cross- 
linking sites. To distinguish between read peaks within 
long read clusters and trim low read coverage surrounding 
strong single peaks, we fit a Gaussian smoothed curve 
(normal kernel function, bandwidth 21) to each read clus- 
ter and used the inflection points of this curve to define 
the boundaries of individual crosslinking sites. This step 
resulted in 91,290 gPAR-CLIP crosslinking sites and 1,915 
PufSp PAR-CLIP crosslinking sites. 
Calculating read coverage of crosslinking sites 
From the set of RBP crosslinking sites derived from all 
gPAR-CLIP libraries, we determined read coverage for 
each site from each individual library by calculating the 
average RPM covering each nucleotide in the crosslink- 
ing site. This coverage was divided by the RPKM of the 
associated gene and multiplied by 1,000 to enable direct 
comparison of RBP occupancy of crosslinking sites 
between growth conditions. 
Assigning FDR to each crosslinking site 
A small fraction of T-to-C mismatches in gPAR-CLIP 
reads likely represent sequencing error instead of cross- 
linking events, so crosslinking sites derived from this 
error were removed. We repeated the crosslinking site 
generation steps using mRNA-seq reads with one or 
two T-to-C mismatches, which represent the rate of 
T-to-C sequencing error for the lUumina HiSeq plat- 
form. For each gPAR-CLIP and mRNA-seq crosslinking 
site, we calculated the T-to-C conversion rate as the num- 
ber of reads with T-to-C conversion events divided by the 
number of total reads covering Ts. gPAR-CLIP and 
mRNA-seq crosslinking sites were binned into groups 
based on total read coverage. For each gPAR-CLIP cross- 
linking site in each bin, we determined the proportion of 
mRNA-seq crosslinking sites with a higher T-to-C conver- 
sion rate than the gPAR-CLIP crosslinking site. This pro- 
portion represents the FDR for that gPAR-CLIP 
crosslinking site. Using a strict 1% FDR threshold, we 
identify 80,883 gPAR-CLIP crosslinking sites. 

Effect of counting statistics on error in crosslinking site 
coverage measurement 

Read coverage of replicate gPAR-CLIP crosslinking sites 
was analyzed to measure reproducibility. For each site, 
we compared the number of reads coming from one 
replicate library to the total number of reads from both 
libraries. Perfect reproducibility would result in a ratio 
of 1:2. We binned crosslinking sites based on total RPM 
and calculated the standard deviation of these ratios for 



each bin. We predicted the standard deviation for 
counting statistics by binomial partitioning of total reads 
for each crosslinking site in each bin between the two 
replicates. When the total number of reads was below 5 
RPM, binomial partitioning predominantly contributed 
to replicate variation (Additional file 13). Above 5 RPM, 
replicate variation stabilized, and counting statistics 
error contributed little to replicate error. 

Conservation analysis 

phastCons conservation scores for each genomic nucleo- 
tide were downloaded from Siepel et al, [39]. Ts with 
CLSs were grouped into 5' UTR, CDS, and 3' UTR 
regions and then ranked and binned by CLS so each bin 
overlapped adjacent bins by 50%. phastCons scores in 
each bin were averaged. As controls, Ts with no CLS 
were grouped in 5' UTR, CDS, and 3' UTR regions, ran- 
domly ranked, and binned as described. phastCons 
scores in each bin were averaged. Controls were calcu- 
lated ten times for each region. 

Unpaired probability analysis 

The unpaired probability of each genomic position was 
calculated using RNAplfold [42] from the ViennaRNA 
package version 1.8.5 using a span of 40 nucleotides and 
an averaging window of 80 nucleotides. Ts with CLSs 
were grouped into 5' UTR, CDS, and 3' UTR regions 
and then ranked and binned by CLS so each bin over- 
lapped adjacent bins by 50%. Unpaired probabilities in 
each bin were averaged. As controls, Ts with no CLS 
were grouped into 5' UTR, CDS, and 3' UTR regions, 
randomly ranked, and binned. The unpaired probabil- 
ities in each bin were averaged. 

Crosslinking site pairedness analysis 

Genomic regions corresponding to crosslinking sites 
were extended to 80 nucleotides centered on the origi- 
nal crosslinking site. These sequences were subjected to 
folding using RNAfold [61] from the ViennaRNA pack- 
age version 1.8.5, and the minimum free energy struc- 
tures were extracted. Predicted structures were aligned 
and ranked by average crosslinking site CLS and divided 
into 100 equally sized, non-overlapping bins. The per- 
centage of nucleotides predicted to be unpaired at each 
position in each bin was computed. Selected structures 
from low, middle, and high CLS bins were visualized 
using VARNA [62]. 

Enriched motif analysis 

gPAR-CLIP crosslinking sites passing a 5% FDR thresh- 
old from genes identified as RBP targets by RIP-Chip 
experiments [12,13] were analyzed by MEME [63]. 5' 
UTR, CDS, and 3' UTR crosslinking sites were analyzed 
separately, and third-order Markov models based on all 
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5' UTR, CDS, or 3' UTR regions were used to model back- 
ground nucleotide compositions. Because gPAR-CLIP 
crosslinking sites on each target might represent a combi- 
nation of RBP recognition sites, we implemented MEME 
using the -mods zoops parameter to allow zero or one 
motif to be found in each site. The following parameters 
were also used: -evt 20, -minw 6, and -maxw 15. 

Gene Ontology enrichment analysis 

GO analysis was performed on genes harboring 3' UTR 
crosslinking sites that were four-fold up- or down-regu- 
lated upon glucose or nitrogen starvation or both. The 
topGO R Bioconductor package was implemented using 
Fisher's exact test for enrichment and Bonferroni cor- 
rection of P- values to adjust for multiple testing [64]. 
Up to 20 GO terms were reported with a P- value <0.01. 

Data availability 

Raw sequence data are available through the NCBI's Gene 
Expression Omnibus [65] using series entry GSE43747. 

Additional material 



deadenylation, and repression of mRNAs encoding proteins destined for 
the mitochondria [29]. (C) Individual replicate coverage of C0X17 3' UTR 
in gPAR-CLIP with average coverage as shown in Figure 2b. 

Additional file 9: Table of putative Puf3p targets. Data include 
identification of the PufSp recognition motif. 

Additional file 10: Table of crosslinking scores in all gPAR-CLIP 
libraries. 

Additional file 11: Analysis of crosslinking scores and conservation of 
genomic Ts in starvation conditions. (A) Cumulative distribution of CLSs 
from 5' UTR, CDS, and 3' UTR regions. (B) Individual replicate coverage of 
ATG8 and TOM40 3' UTRs in gPAR-CLIP with average coverage as shown 
in Figure 3d. (C) Mean phastCons scores for Ts ranked and binned by 
CLSs. Control lines represent mean phastCons scores of randomly ranked 
and binned Ts with no CLS, repeated ten times. 

Additional file 12: Analysis of RNA secondary structure in starvation 
conditions. (A) Mean unpaired probability scores for Ts ranked and 
binned by CLSs. Control lines represent mean unpaired probability of 
randomly ranked and binned Ts with no CLS, repeated ten times. (B) 
Heatmaps of pairedness of 5' UTR and CDS crosslinking sites ranked by 
average crosslinking site CLS. (C) Percentage of Ts ranked and binned by 
CLSs in conserved secondary structural elements as defined by RNAz 
[43,44]. Control lines represent percentage of randomly ranked and 
binned Ts with no CLS in conserved secondary structural elements, 
repeated ten times. 

Additional file 13: Intra-replicate variation of crosslinking site coverage 
and global changes in 5' UTR crosslinking sites. (A) Determination of 
minimum crosslinking site coverage required for comparison of sites 
across environmental conditions. We chose 5 RPM as the minimum 
crosslinking site coverage needed for confident quantification since, at 
this coverage, the standard deviation of the fraction of crosslinking site 
reads coming from one replicate library stabilized at <0.2. Shown are 
data from wild-type (\A^ replicate libraries; similar results were obtained 
for all library types. (B) Intra-replicate variation of 3' UTR crosslinking sites 
in WT and glucose (left) or nitrogen (right) starvation conditions. Dotted 
lines represent three standard deviations from the mean and correspond 
to approximately four-fold change between WT replicates. (C) Same as 
(A) but for 5' UTR crosslinking sites. (D) Global changes in 5' UTR 
crosslinking site coverage upon glucose (left) or nitrogen (right) 
starvation. 

Additional file 14: Table of enriched Gene Ontology terms. Table 
includes data for genes with more than four-fold increased and/or 
decreased 3' UTR crosslinking sites in nitrogen and/or glucose starvation 
conditions. Only GO terms with P-value <0.001 are reported. 

Additional file 15: Assessment of crosslinking site and mRNA changes 
in starvation conditions. (A,B) Enriched GO terms of genes up- and 
down-regulated upon glucose (A) or nitrogen (B) starvation. (C) The 
number and percentage of 3' UTR crosslinking sites with indicated 
changes in crosslinking site coverage and corresponding mRNA 
expression upon glucose starvation. (D) The number and percentage of 
3' UTR crosslinking sites with indicated changes in crosslinking site 
coverage and corresponding mRNA expression upon nitrogen starvation. 

Additional file 16: Global changes in 3' UTR crosslinking site upon 
glucose starvation. Changes in crosslinking site coverage from one 
replicate library each of wild-type (V\^ and glucose starvation conditions 
are plotted versus changes in the corresponding mRNA from one 
replicate library each of WTand glucose starvation conditions. Dotted 
lines and colors are as in Figure 6a. 

Additional file 17: Changes in 3' UTR crosslinking sites on ALD4 and 
5TM1 upon glucose starvation. Same as Figure 6b,c but showing 
crosslinking site coverage and mRNA expression in individual replicate 
libraries. 

Additional file 18: Global changes in 3' UTR crosslinking site upon 
nitrogen starvation. Changes in crosslinking site coverage from one 
replicate library each of wild-type (WF) and nitrogen starvation 
conditions are plotted versus changes in the corresponding mRNA from 
one replicate library each of WT and nitrogen starvation conditions. 
Dotted lines and colors are as in Figure 7a. 



Additional file 1: Sequencing and mapping statistics. Read counts 
and T-to-C conversion rates for all gPAR-CLIP, mRNA-seq, and PAR-CLIP 
libraries. 

Additional file 2: Pipeline for generating crosslinking scores and 
crosslinking sites. Processing steps used to generate crosslinking scores 
and crosslinking sites from gPAR-CLIP and mRNA-seq data. 

Additional file 3: Computational identification of crosslinking sites. (A) 
Illustration of sequence block generation, Gaussian distribution fitting, 
and cluster segmentation to identify individual crosslinking sites. (B) 
Pearson correlation coefficients for all gPAR-CLIP and mRNA-seq replicate 
libraries based on gene RPM values. (C) Separation of T-to-C sequencing 
errors from crosslinking-induced mismatches. Plotted for each cluster is 
T-to-C RPM coverage versus total RPM coverage from gPAR-CLIP or 
mRNA-seq libraries. (D) Percentage of annotated 5' UTR, CDS, and 3' UTR 
regions with at least one crosslinking site with >5 RPM. 

Additional file 4: Table of gPAR-CLIP crosslinking sites. Data include 
crosslinking site coverage in gPAR-CLIP libraries, FDRs, conservation 
scores, and unpaired probabilities. 

Additional file 5: Visualization of crosslinking site periodicity. Distribution 
of start-to-start nucleotide distances between 5' UTR, CDS, and 3' UTR 
read clusters from gPAR-CLIP and mRNA-seq libraries. Only distances 
from gPAR-CLIP CDS read clusters were enriched for multiples of 3 (red 
dots). 

Additional file 6: Table of gene expression in mRNA-seq libraries. 

Values represent reads per million mapped reads per kilobase of 
transcript (RPKM). 

Additional file 7: Table of Puf3p PAR-CLIP crosslinking sites. Data 
include Puf3p recognition motif and whether the crosslinking site was 
identified in gPAR-CLIP libraries. 

Additional file 8: Analysis and comparison of PAR-CLIP-identified Puf3p 
targets. (A) Puf3p PAR-CLIP identified crosslinking sites in 147 (67%) of 
the 220 Puf3p target mRNA identified by RIP-Chip; 174 Puf3p RlP-Chip- 
identified target mRNAs contain the Puf3p recognition motif UGUAAAUA 
[12,13]. Puf3p PAR-CLIP identified motif-containing crosslinking sites in 76 
(44%) of these mRNAs and in 265 additional mRNAs, suggesting post- 
transcriptional regulation by Puf3p for these 265 novel targets. (B) GO 
enrichment analysis of 265 PAR-CLIP-identified, motif-containing Puf3p 
targets. Results are consistent with Puf3p's role in localization. 
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Additional file 19: Changes in 3' UTR crosslinking sites on INOl and 
AGP3 upon glucose starvation. Same as Figure 7b,c but showing 
crosslinking site coverage and mRNA expression in individual replicate 
libraries. 

Additional file 20: Table of enriched sequence motifs identified by 
MEME. Data include gPAR-CLIP crosslinking sites passing a 5% FDR 
threshold on target mRNAs of known RBPs identified by RIP-Chip. 
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